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H ' Abstract 

An important property of the Kalman filter is that the underlying Riccati flow is a 
contraction for the natural metric of the cone of symmetric positive deflnite matrices. 
^\ , The present paper studies the geometry of a low-rank version of the Kalman fllter. 

The underlying Riccati flow evolves on the manifold of fixed rank symmetric positive 
semideflnite matrices. Contraction properties of the low-rank flow are studied by means 
of a suitable metric recently introduced by the authors. 
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1 Introduction 



2 . The Kalman-Bucy filter [15] (KF) is a very popular method in engineering, that allows to 

compute an estimate of the state of a dynamical system from several sensors measurements, 
possibly corrupted by measurement's noise. The principle is to merge predictions from a 
trusted model of the dynamics of the system with measurements, in order to 1- efficiently 

^ ] filter the noise and 2- get an accurate estimate of the (unknown) internal state of the system 

On ■ in real time. The estimation relies on the computation at each time of a positive semi-definite 

'nI I matrix P, that represents the covariance of the estimation error when the noises are assumed 

to be white and Gaussians. The differential equation verified by P is a matrix Riccati 
equation. The Kalman filter is designed for linear dynamical systems, but when the system's 
trusted equations are non-linear, the extended Kalman filter (EKF) is a natural extension 
of the Kalman filter based on a linearization of the equations around each estimate. In 
this case, P verifies the same equation but with time-varying coefficients, and the computed 
positive semi-definite matrix is only an approximation of the true error covariance matrix. 
The stability analysis, both for the KF and EKF, generally relies on results on the Riccati 
flow on the set of positive semi-definite matrices, see [ISl [^[5]. 



JH ' For non-linear systems, such as the one described by the aforementioned Riccati diffcrcn- 

■ - - tial equation, the contraction property may prove very useful. It essentially requires that the 

distance between two arbitrary solutions does not expand. When the contraction is strict, 
all solutions exponentially converge towards each other. Although a stronger mathematical 
property than global convergence to a particular solution, as it implies exponential con- 
vergence with a specified rate, it is sometimes an easier property to prove. In the control 
community it has been popularized by Lohmiler and Slotine jl6) . who have presented it as 
a useful tool to analyze some naturally stable systems, as in e.g. [5], as well as for the de- 
sign and stability analysis of non- linear observers, see [161 1131 [21 S]- A potential interest of 
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the contraction property is that the use of a non-EucUdcan metric is often suggested by the 
dynamics and/or by the nonhnear nature of the state-space. 

A relevant example of non- Euclidean distance between covariance matrices is the natural 
distance of the cone of positive definite matrices [12| . It enjoys several invariance properties, 
and appears in information geometry as the Fisher distance between covariance matrices |21] . 
P. Bougerol has proved in [7] that the discrete-time Riccati equation is a strict contraction on 
the cone of positive definite matrices for the natural distance of the cone, see also [IS] . This 
generalizes to some extent the results of G. Birkhoffon contraction properties of positive maps 
on cones (see also [S] for observer design and contraction properties on cones) . Those results 
have been partially extended in continuous time in }13| using the same metric. It seems to 
be the right non-Euclidean metric to analyse the stability and contraction properties of the 
Kalman filter. 

When the dimension of the state space becomes very large, which is e.g. the case for 
discrctized systems described by partial differential equations, the Riccati equation of the 
KF and EKF become numerically intractable due to matrix storage and complexity of the 
updates. Essentially motivated by applications in weather forecasting and oceanography, 
where the covariance matrix can contain up to 10^ unknowns, the idea of projecting the 
state space vector onto a lower dimensional space has appeared several decades ago e.g. [5]. 
This operation reduces the storage and multiplication of matrices of size n x n to matrices 
of size r X n with r ^ n going from quadratic complexity and memory requirement in the 
state space dimension n to linear complexity in n. The low-dimensional subspace is supposed 
to capture the largest eigenvalues of the covariance matrix, where the error needs to be the 
most attenuated. One such method is the so-called SEEK filter [19l [20], which is a low- 
rank version of the Kalman filter (see also the recent paper [llj). In contrast to previous 
methods, the covariance matrix still has a fixed rank but its span is free to evolve at each 
step. Unfortunately, the natural metric is not defined on the boundary of the cone, that 
is, rank-deficient positive semi-definite matrices, and the hitherto analysis of the geometry 
of the Kalman Filter via its contraction properties for the natural metric, does not readily 
apply to low-rank Kalman filters. 

In this paper, a novel low-rank Kalman filter is proposed. The underlying Riccati fiow 
evolves on the manifold of fixed rank symmetric positive semidefinite matrices. Exploiting 
the decomposition of any rank-r positive semi-definite matrix P = URU', where U G R"^'' is 
a matrix whose columns form an orthonormal set, and _R is a r— dimensional positive definite 
matrix, a Riemannian geometry recently introduced by the authors [6| has been shown to 
retain many invariance properties of the natural metric of the cone. As a consequence, 
the low-rank Riccati flow admits contraction properties when the infinitesimal distances are 
measured with this non-Euclidean metric. The contributions of this paper are threefold. 
First a continuous-time version of the contraction properties of the KF for the natural 
metric of the cone is given. This contribution is rather tutorial, as the results of [7] in 
discrete time imply contraction in continuous time. This section can be skipped by a reader 
who would be familiar with contraction theory in continuous-time. The paper contains 
two novel contributions: a novel version of the low-rank Kalman filter defined in a proper 
geometric framework is proposed. And, the low-rank Kalman filter is proved to inherit some 
contractions properties of the Kalman filter for a recently introduced metric on the set of 
fixed-rank positive semi-definite matrices. As a by-product stability properties are proved 
and convergence speed rates are characterized around stable steady-states. 

The paper is organized as follows. In Section 2, the contraction properties of the Kalman 
filter are recalled, and some results of [7] are shown to be provable directly in continuous 
time. In Section 3, a low-rank Kalman filter is introduced. In Section 4, we present the 
natural metric on the set of fixed-rank positive semi-definite matrices recently introduced by 



the authors in |6|. Finahy, Section 5 addresses the contraction properties of the low-rank 
Kalman fiUer. 

1.1 Notation 

• P+(n) is the set of symmetric positive definite n x n matrices. 

• S+(r, n) is the set of symmetric positive semi-definite nxn matrices of rank r < n. We 
will only use this notation in the case p < n. 

• St(r,n) = 0(n)/0(n — r) is the Stiefcl manifold i.e. the set of n x r matrices with 
orthonormal columns: U'^U = Ip. 

• span(^) is the subspace of R" spanned by the columns of ^ £ M"^". 

• TxM is the tangent space to the manifold M at X. 

• A' denotes the transpose of the matrix A. 

2 Contraction properties of the Kalman filter 

Consider the linear time- varying continuous-time system 

f^x{t) = A{t)x{t) + G{t)w{t) 



y(t) = C{t)x{t)+Hit)r,{t), 



(1) 



where x{t) e R", w{t) S M™, y{t), rjit) £ W. The random vectors wit) and ??(<) are 
independent Gaussian white noise with zero mean and covariance matrix equal to the identity. 
The matrices A{t), G{t), C{t) and H{t) have the appropriate dimensions and it is assumed 
that A{t) is invcrtible for all t. 

The classical equations of the Kalman filter define a recursion for 

x{t) = nx{t)\{y{s)}o<s<t] 

the best estimate of the true state x, using the conditional error covariance matrix P{t) 

jx^{A- PC'{HH')-^C)x + PC'iHH'y^y (2) 

—P = $t(P) ^AP + PA' + GG' - PC'(HH')"^CP (3) 

at 

The mapping $t defines the continous matrix-valued Riccati differential Equation, and for 
each P G P+{n), ^t{P) is a tangent vector to P+{n) at P, i.e. $t(P) G TpP+{n). 

2.1 Natural metric of P+{n) 

The geometry of the n-dimensional cone of symmetric positive definite matrices P+ (n) has 
been well-studied in the literature. The group GL{n) acts transitively on this set via the 
following action 

jA:P+{n)^P+{n), P ^ APA' (4) 

for any A G GL{n). If P is the covariance matrix of a gaussian vector of zero mean x, 
then ^a{P) is the covariance matrix of the transformed vector Ax. li P,Q G P+{n) are 
two arbitrary points of the cone there always exists A G GL{n) such that Q ~ ja{P)- 



This property makes P+(n) a so-called homogeneous space under the GL{n) group action. 
The isotropy subgroup is the subgroup of GL(n) stabilizing the identity matrix i.e., {A G 
GL{n), AIA' = /} = 0{n). As a general property of homogeneous spaces, the following 
identification holds 

P+{n)=^GL{n)IO{n) 

The isotropy subgroup being compact, there exists a GL(n)-invariant Riemannian metric on 
P+{n) called the natural metric [12] • This metric is defined as the usual scalar product at 
the identity 

5/(Xi,X2) = Tr(XiX2) 

where Xi,X2 are tangent vectors at the identity, i.e. two symmetric matrices. If Yi,Y2 are 
two tangent vectors at P G P+(n), the group action (|4]) with A = P~^^^ transports them to 
the tangent space at the identity, where P"^/^ is defined as the symmetric square root of P. 
The invariance of the metric then implies 

gp{yi,Y2) = (7/(p-i/2y^p-i/2^p-i/2y^p-i/2) ^ Ti-{p-^YiP-^Y2) 
The associated Riemannian distance is 

n 

1 

where Ai,--- , A„ are the eigenvalues of PQ~^. The main property of this metric is its 
invariance to conjugacy and inversion. For any A £ GL(n) and P^Q G P+{n) 



1p^{n) 



{APA' ,AQA') ^ dp^^,,^{P,Q) ^ dp^(„){p-\Q-^) 



Note that, this metric coincides with the Fisher Information Metric (FIM) for the following 
statistical inference problem: the available observations have a Gaussian distribution with 
zero mean and a covariance matrix parametrized by an unknown matrix in P £ P-^(n). The 
distance distorts the space to measure the amount of information between the distributions. 
For instance dp^i^^^^P^ P + 5P) — >■ oo when P ^> 0, for any 5P >- 0. This is understandable 
from an information point of view, as a Gaussian distribution with mean and covariance 
matrix, carries infinitely more information than a Gaussian with a strictly positive covariance 
matrix. Finally the FIM enjoys invariance properties to reparameterization i.e. x i— >■ Ax in 
the case of mean Gaussian distributions, and thus it is no surprise to see it coincide with 
the natural metric of the cone. 

2.2 Contraction property 

The notion of contraction |16| for a dynamical system described by the fiow -irx — f{x,t) 
can be interpreted as the (exponential) decrease of a the (geodesic) distance between two 
arbitrary points under the fiow. 

Definition 1 Let -^x = f{x,t) be a smooth dynamical system,, defined on a C^ embedded 
manifold M G M equipped with a Riemanian metric denoted by gxivi, V2) on the tangent space 
at X. Let X{x^t) denote the flow associated to f: 

Xix,0)=x, j^X{x,t)^f{X{x,t),t) (5) 



Let V be a tangent vector at x. Let a^{s)^ Q < s < 1 he a geodesic emanating from x, with 
direction v. Let N be a subset of M . The system is called a contraction for the metric g on 
N if for all x 6 N , and v 6 TxM , we have 

d ^ dX{a,{s),t)) dX{a,{s),t)) ^^ ^ ^ dX{a,{s),t)) dX{a,{s),t)) ^^ 

-t:{.9^( ^ , 1 )} < -2A{g^( , )} 6) 

at ds ds ds ds 

where A > 0. //A > the contraction is called strict. 

Wc have the following result [TBI H] • 

Theorem 1 Under the notation and assumptions of Definitions^ let xcXi e N. Assume 
there is a geodesic 7(s), < s < 1 linking Xq and Xi, and for all t £ [0,T] the transported 
geodesic X{'y{s),t) C N for all < s < 1, then if dg is the geodesic distance associated to 
the metric we have 

dg{X{xo,t),X{xi,t)) < e-^'dg{xo,xi), Vt E [0,T] 

which is an exponential decrease if X> 0. 

The proof relies on the fact that the length of the curve X{j{s),t) transported by the flow, 
in the sense of the metric g, is the sum of all the length elements \\dX{'y{s), t)/ds\\g, and the 
infinitesimal property ([6]) can be extended to the length of the entire curve. 

Strict contraction is a natural property to expect from a filter, as it implies the exponential 
forgetting of the initial condition. And it turns out that the discrete-time Riccati flow has 
been proved to be contracting for the natural metric of the cone P-f (n) by Bougerol under 
standard assumptions, and it has also been proved to be a contracting for the same metric in 
continuous-time in |13j . under more restrictive assumptions. Indeed if 6p is a tangent vector 
at P we have the following result 

Lemma 1 Assume that the process noise GG' is stricly positive. Then the Riccati equation 
^ is a contraction in the sense of Definition 1 for the natural metric of the cone. At each 
P we have A = fJ-fprnax where fi is a lower bound on the eigenvalues of GG' and Pmax is an 
upper bound on the eigenvalues of P. 

Proof We have 

^^gASP,SP)^^Triip-^SPr) 

= 2Tiip-^ 5P{—P-^)SP) + 2Tiip-^ 5 PP-^ — 5 P) 

—SP = ASP + SPA' - SPG'(HH')-^GP - PG'iHH'y^GSP 
dt 

d- p-i ^ p-ipp-i ^ _p-i^ _ ^/p-i _ p-iGG'p-i + C'{HH')-^G 



where 



and 



dt 
As a result 



^gp(SP,SP) = -2TT((P-^SPP-^SP)GG'P-^) 
dt 

- 2Tii{p-'^SPP-^SP)G'{HH')-^GP) 
< -2TT{{P-^SPP-^SP)GG'P-^) 



which proves that the Riccati flow is contracting in the sense of ([6]) for the natural metric 
of the cone with A = /i/pmax where fi is a. lower bound on the eigenvalues of GG' , that is 
assumed to be invertible, and Pmax is an upper bound on the eigenvalues of P. 

The contraction property of the Riccati equation is in fact due to the invariances enjoyed 
by the natural metric of the cone. This can be understood the following way: the equation 
writes f^P = AP + PA' + GG' - PG'{HH')-^CP. The two first terms neither expand nor 
contract as it is the differential form of the transformation 7b(P) = (/+tA)P(/+Tj4)' which 
is an isometry for the distance dp_^(„). The addition of a positive matrix -^P = Q where 
Q is a positive matrix is neither expanding nor contracting in the Euclidian space, but it 
contracts for the natural metric as gp tends to dilate distances when P becomes large. Finally 
■^P = —PG'{HH')~^GP is a naturally contracting term which is paramount in the theory 
of the Kalman filter and observers (correction term). We have the following proposition, 
refining the results of |13j in the case of time- independent coefficients: 

Proposition 1 Suppose the matrices A, C, G do not depend on the time t, (A, C) is observ- 
able and G is full-rank. There exists a unique solution Q G P+ (n) of the algebraic Riccati 
equation defined by $((5) = where $ is given by (J3|). For any R G P+(n), let Sr be the ball 
center Q and radius dp in){Qi R) 

Sp. = {p £ p+(7i), dp^(„^{Q,p) < dp^(„)(g,p)} 

Let Mp = sup{||P||2,P G Sp} < cxD, and fi > be the lowest eigenvalue of GG' . Let 
Pi{t),P2{t) be two arbitrary solutions of the Riccati equation jrP = <&(P) initialized in Sp. 
Then Pi(i),P2(i) G Sp for any t >0, and the following contraction result holds 

dp^^^)iPi{t),P2{t)) = e-^*dp^(„)(Pi(0),P2(0)) 
In particular P{t) tends exponentially to the stationnary solution Q for the distance dp^(^n)- 

Proof The proof is a straightforward application of Theorem 1, where we used the fact 
that dp („\ is non-expanding and thus by Theorem 1 Pi (t) , P2 (i) remain in the compact set 
Sp. 

3 Low-rank Kalman filtering 

With ever growing size of measurable data and ever growing complexity of the models, the 
high computing cost and storage requirement of the Kalman filter (because of the need 
to implement the Ricatti equation) can become prohibitive. This has especially been a 
problem in the field of weather forecast and oceanography applications, where Kalman filters 
are a very natural and attractive tool, but where the models rely on discretized partial 
differential equations, and where the state vector as well as the measurement vector can 
become prohibitively large [ID] . Those problems resulted in modified versions of the extended 
Kalman filter where the state vector is projected on a low-dimensional space, see e.g. j9l 
[To] . Note that this filter could also meet applications in Extended Kalman Fitler based 
Simultaneous Localization and Mapping (e.g. dHj), where the Sparse Extended Information 
Filter, which is supposed to retain the largest terms in the covariance matrix, plays an 
important role. 

In order to define covariance matrices of vectors that have been projected onto a subspace 
of dimension r <C n, one mainly thinks of two parameterization of positive semi-definite 
matrices of dimension r, i.e. matrices of S~^{r,n) 

P = ZZ' = UR^U' 



zr 



R 



U^ 



U 



3 



where P G S+{r,n), Z e E^^'', R G P+(r), U G St(r,n) (see figure). The use of the 
first decomposition goes back to the early days of Kalman filtering, motivated by numerical 
stability of the filter. Indeed, updating Z and defining P as ZZ' enforces the constraint of a 
positive symmetrical semi-definite covariance matrix P. The second decomposition UB?U' is 
meaningful from a geometrical perspective and is suited to a simple statistical interpretation: 
URU' is a fiat ellipsoid, U defines the subspace in which the ellipsoid lives, and R defines 
the form of the ellipsoid. Such an ellipsoid is supposed to capture the principal directions 
of variability in the error covariance matrix and R represents the error covariance matrix on 
the lower dimensional subspace. The set S'^ (r, n) has proved to be a Riemannian manifold, 
when equipped with appropriate metrics based on the first decomposition [Ml [TJ , as well as 
the second decomposition |6]. Another interesting metric can also be found in [3]. 

In this paper, we seek to prove that the low-rank Riccati equation inherits some of the 
contraction properties of the full-rank Riccati equation. As the contraction property of the 
Riccati equation has been proved to stem from the invariance properties of the natural metric 
of the cone, we will focus on the second decomposition. Indeed, metric [6] enjoys several 
invariance properties and appears to be the most natural generalization of the natural metric 
of P+{n) to the set S~^{r,n). The matrix factorization 



p = URU' 



(7) 



suggests to rewrite the Riccati equation as 



— URU' = AURU' + URU' A' + GG' - URU'G'iHH'y^CURU' (8) 

at 

However, this equation is not rank-preserving because of the process noise term GG' . Ge- 
ometrically speaking, this means that ^URU' does not belong to the tangent space to the 
manifold S^{r,n) at URU'. To circumvent this difficulty, ^9\ proposes to disregard the 
process noise, i.e. G = so that the update is rank-preserving. However, as seen in Section 
2, process noise has paramount stabilizing properties. This is why it is also proposed in [l9] 
to perturb a little the update with a forgetting factor, in order to recover some stability 
properties. In this paper, we propose a rank-preserving equation including process noise. To 
preserve the rank, the noise can be projected on the low-dimensional space, i.e. G t-^ UG. 
However, the invariance properties discussed in the next section will in fact further restrict the 
noise to be diagonal, i.e. G = ^I, leading to the modified rank-preserving Riccati equation 



-fURU'^ ^ AURU' + URU' A' + ^i^UU' - URU'C'{HH')-^GURU' (9) 

The contraction properties of this low-rank Riccati equation are studied in the next sections. 



4 An invariant metric on the manifold of fixed-rank pos- 
itive semidefinite matrices 

From the previous sections, it seems natural to measure the contraction using the polar 
parametrization P = URU' . However in this decomposition there is a catch, the underlying 
geometry is a quotient geometry because for any orthogonal matrix O G 0{r) we have: 

P = URU' = {UO){0'RO){0'U') (10) 

The representation P ~ URU' with {U, R) e St(r,n) x P+(r) is thus univoque up to the 
equivalence relation {U,R) = {UO,0'R'0) for any O G 0(p). The manifold S'+(r, n) thus 
admits a quotient representation 

5+(r,n) = (St(r,n)xP+(r)/0(r) 

Let us introduce the metric proposed in [6]. Given a representative {U,R) of P G 5'^(r, n), 
the tangent vectors of TpS^{r, n) are given by the pair (A, D) where 

D ^ RDoR 

such that U±_ e St{r,n - r) , U'U±_ = 0, and Dq e Sym(p) = T/P+(r). The chosen metric 
of S'^{r,n) is merely the sum of the infinitesimal distance between subspaces and between 
low-rank positive definite matrices of P-|_(r): 

g^u,R-)ii^i,Di), {A2,D2)) = Tr(Af A2) + TviR-^D.R-^D^R-'), (12) 

generalizing the natural metric of the cone in a natural way. According to [5], the space 
S~^{r,n) = (St(r,n) x P-(_(r)/0(r) endowed with the metric P^ is a Riemannian manifold 
with horizontal space 

•H([/,fl2) = {(A,D) : A = U±B, B e m("-'-)><'', D = RDqR, Dq G Sym(p)} 

Furthermore, the metric proved to inherit some invariancc properties of the natural metric, 
namely invariancc to orthogonal transformations, dilations, and pseudo-inversion. 

5 Contraction properties of the low-rank Kalman filter 

The Riccati low-rank flow ([9]) defines a tangent vector to the manifold. It corresponds to the 
following flow defined in the horizontal space 

^U^{I~UU')AU (13) 

at 

^R ^ AuR + RA'jj + fi^I - RC'jj{HH')-^CuR (14) 

where Au = U'AU, Cu — CU are the matrices projected in span U. 

The system has a triangular structure, that is, the first equation is independent from the 
second one. The dynamics of U are known as the (non symmetric) Oja flow. In turn, R 
verifies a time- varying Riccati equation on the lower dimensional cone P^{r). 

Topological obstructions seem to prevent us from proving the low-rank Kalman filter is 
a global contraction on S+(r, n). Indeed, even in the simplest case r — 1 and n = 2, the fiow 



p^ defines a vector field that can not be a contraction on the whole circle St(l,2). This 
would imply there exists a globally converging vector field on the circle which is impossible. 
Thus, global contraction seems to be out of reach, and we are going to prove instead local 
stability and contraction around any dominant invariant subspace. 

5.1 Local contraction of subspace flows 

The flow (fT^l) has been well studied under the assumption A = A' . In this case, the flow is the 
gradient flow of the generalized Rayleigh quotient cost function r^(f/) = Tr{U'AU) for the 
natural metric on the Stiefeld manifold St{r,n). Because the cost is invariant to rotations, 
the gradient flow can also be interpreted as a gradient flow on the Grassmann manifold 
Gr(r, n) w St(r, n)/0(r), i.e. the manifold of j'-dimensional subspaces in a ri-dimensional 
ambiant space. This subspace flow has been studied by Oja [18] and by Wei-Yong, Hclmke 
and Moore [J^. If the eigenvalues of A are distinct, the flow converges for almost every initial 
condition to the dominant subspace of A, i.e. the eigenspace associated with the r largest 
eigenvalues of A. 

For any matrix A, local exponential convergence of the dominant eigenspace is ensured 
provided that there is a gap between the r first eigenvalues of the symmetric part of A, and 
the others, as shown by the following lemma. 

Lemma 2 The matrix (A + A')/2 is real symmetric and can be diagonalized. Let Xjj be the 
gap between the r-th and the r-\- l-th eigenvalues. Let Ur € St(r, n) represent an orthonormal 
basis of the the subspace spanning the r dominant eigenvalues of the symmetric part of A. If 
Xij > 0, Ur is an exponentially stable equilibrium of the subspace flow psp in the sense of 
the Grassman metric. 

Proof Let U± span the orthocomplement of Ur and SU = U±K with K e R("^'')^'". Ur is 
an equilibrium, and the first-order variation of (|13p reads 



which implies 



— SU =(I- UrU'r)A5U - SUU'AUr - UrWAUr 

at 

= 2Ty{SU'ASU - SU'SUUrAUr) 
= TT{6U'iA + A')SU - 5U'5UU'r{A + A')Ur) 
= Tr(7f'[/_i(yl + A'U^K - K'KU^{A + A'Ur) 
< ~2XuTt{KK') < -2\uTt{SU'SU) 

We thus see that every solution that converges to the dominant eigenspace of the symmet- 
ric part of A, is eventually contracting for the natural metric of the Grassmann (subspace) 
manifold. 

5.2 Local contraction of low-rank Riccati flow 

Here we show the low-rank Riccati flow is locally contracting around the dominant eigenspace 
of the symmetric part of matrix A. 

Proposition 2 Suppose there is a gap between the r-th and r + l-th eigenvalue of A + A' , and 
let Ur G St(r,n) span the r-dominant subspace. Then there exists an open set V G St{r.,n) 
containing Ur such that the dynamics ()13|) - (|14p is a strict contraction for the metric (|12|) on 
N = K X V C S^ (r, n) , where K is any compact set of P+ (r) . 



Proof Let {SU, 6R) E T~L(Ur-,R'^)- From Lenima[2l and by continuity, there exists A[/ > such 
that (|13|) is a strict contraction with rate \u locally around Ur- The contraction property 
of the flow (fT4| is a direct consequence from Lemma 1 which yields ■^Tt{{R~^SR)^) < 
— 2Ai{Tr((i?^^(5_R)^) with Xu = /x/rmax where rmax is an upper bound on the eigenvalues 
oi R G K. The contraction property of the low-rank Kalman filter stems directly from the 
definition of the proposed metric on S+(r,n), since 

^[Tt{SU'SU) + Tr((i?-Mi?)2)] < ^2XuTr{SU'SU) - 2\rTt{{R-^5R)^) 

and thus the flow is contracting in the sense of Definition 1 in the subset N with rate 
A = min(Ac/, A_r). 

The latter proposition proves that around the dominant subspace oi A + A' , the low-rank 
Kalman filter defines a contraction for the proposed metric on S^(r, n). In particular, the 
speed of convergence is characterized by a known rate A. 

5.3 Eventual contraction in the case of symmetric A 



As mentionned above, system ([T3| - (|T4)) has a triangular structure. The dynamics of U is 
independent of the dynamics of R, and under the assumption that A = A' and the eigenval- 
ues of A are distinct, the fiow converges for almost every initial condition to the dominant 
subspace of A. Once U{t) has converged to C/qo = Ur, the matrices Au and Cu are projec- 
tions of A and C on a fixed subspace Ur- As a result, if A and C are time- independent, ([Ti)) 
asymptotically becomes a Riccati equation with time-independent coefficients, implying con- 
vergence of R{t) from Proposition 1, as soon as [Au^ , Cjj^) is observable. Those two latter 
facts allow to conclude the cascaded system (fTS jl - ifMl) converges to fixed values {Uoo,Roo), as 
explosions in finite time can not occur (it is easily checked that the trace of R can grow at 
most exponentially). 

Proposition [5] implies that the low-rank Riccati flow is a contraction in a neighborhood 
of (Uoc, Roo)- We can thus conclude, under the above assumptions on A, that for almost any 
initial condition, the flow is eventually contracting for the proposed metric (I12p . i.e. after a 
finite time T > 0, the system p ^ - p4|) is a strict contraction. 

Remark 1 The complete filter also involves the equation for the estimated mean x of x, that 
writes -ix = {A — KC)x + Ky with K = URU'C {HH')~^ . Its convergence properties are 
a consequence of the contraction properties of the low-rank Riccati equation. Indeed, as the 
gain K asymptotically converges, it can be proved the mean error x — x projected on the final 
subspace spanned by Uoo converges to zero under the additional assumption that the output 
map satisfies CUooU^ ~ C. 

5.4 A counter-example for skew-symmetric A 

Convergence properties are necessarily weaker in the case of a nonsymmetric matrix A, as 
illustrated by the following example. Consider (|13|) on St(l,3) with A skew-symmetric. Then 
[/ = a; e E'^ is a vector on the unit sphere and it satisfies 

d 

-—X — oj A X 
dt 

for some oj G M^. It is clear that this fiow docs not possess any contraction property as an 
initial angle between two vectors remains unchanged by the flow, and thus the distance in 
the Stiefel manifold is constant over time. 
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For A G M"^" such that A — A' j^ 0, the skew symmetric part of A induces a rotation of 
the subspace spanned by U, and one should not expect neither convergence, nor contraction. 
It seems reasonable to hope for ultimate contraction, even with general A, as soon as the 
associated full-rank equation converges to a stationnary state. Proving this statement is 
beyond the scope of the present paper. 

5.5 Proposed implementation in discrete-time 

In order to meet the implementation constraints, the proposed filter must be written in 
discrete time. The transposition is not straightforward, as the rank constraint must be 
preserved numerically at each step, i.e. the updated covariance matrix must remain on the 
manifold. It means the updated U must have orthonormal columns, and the updated R must 
be positive definite. The so-called retractions (e.g., [1]) conveniently map the tangent space 
at any point, to the manifold. Using the retractions adopted in the work [1] [17], and the 
usual discrete Riccati equation we propose the following implementation in discrete time: 

[/+ = qi{AU - dt{UU'AU)) 

R+ = AuRA'jj - AuRC[j{CuRC[j + dtHH'y^CuRA'jj + dtf?I 

where A = A + dtl and qf() extracts the orthogonal factor in the QR decomposition of its 
argument. Note this latter operation boils down to orthonormalizing the factor U at each 
step. 

6 Conclusion 

In this paper we have analyzed contraction properties of the low-rank Kalman filter via a 
recently introduced metric that extends the natural metric (or Fisher information metric) 
to the facets of the cone of positive definite matrices. In the process, the low-rank Kalman 
filter equations were a little modified and put on firm geometrical ground. Here are some 
concluding remarks and challenges that could deserve attention: 

• In discrete-time [7], the contraction rate of the Riccati flow for the natural metric 
of P+(n) only depends on the coeflficients A,B,C, D. However, in continuous-time it 
depends on the point P G P+(n) at which it is evaluated (see Lemma 1). This difference 
is a little surprising and would deserve to be better understood. Note also, that if one 
can prove simply there is an arbitrary large invariant set for the flow under detectability 
and observability conditions. Lemma 1 implies the contraction rate is uniform and hence 
convergence by the flxed point theorem. 

• Thus, one could hope to prove stronger results on the contraction properties of the 
low-rank Riccati flow in discrete-time. However, computing explicitely the distance 
between two low-rank matrices with the metric proposed remains an open problem. [6] 
proposes special curves in S^(r,n) that approximate the geodesies. Proving stronger 
results on the contraction of thoses special curves under the Riccati flow seems to be 
straightforward enough. 

• In this paper we proved contraction properties of the flow on a (large enough) subset 
of the manifold. Although global contraction seems out of reach because of topological 
obstructions, we proved "eventual contraction" (i.e. contraction after some finite time 
for almost any initial condtion) in the special case where A is symmetric. Without this 
assumption, Oja flow, and thus the low-rank Riccati flow, do not necessarily converge. 
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• The filter proposed in this paper requires a reduced number of numerical operations and 
storage capacity. It would be of interest to test its efficiency in a particular large-scale 
application and to evaluate its relative merits with respect to alternative approaches 
that focus on sparsity of the covariance matrix (e.g. |23j). 
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